Ontology based Web Page Topic Identification
نویسندگان
چکیده
منابع مشابه
Ontology based Web Page Topic Identification
With the emergence of the web, lots of research efforts are made in the area of Web Mining. This paper proposes an automatic approach for automatic topic identification from the web pages. The contribution of this research is in the approach of automatic topic identification of web pages that can provide better results. The topic of the web documents is identified through ontological approach.
متن کاملWeb page language identification based on URLs
Given only the URL of a web page, can we identify its language? This is the question that we examine in this paper. Such a language classifier is, for example, useful for crawlers of web search engines, which frequently try to satisfy certain language quotas. To determine the language of uncrawled web pages, they have to download the page, which might be wasteful, if the page is not in the desi...
متن کاملWeb-page Indexing based on the Prioritize Ontology Terms
In this world, globalization has become a basic and most popular human trend. To globalize information, people are going to publish the documents in the internet. As a result, information volume of internet has become huge. To handle that huge volume of information, Web searcher uses search engines. The Web-page indexing mechanism of a search engine plays a big role to retrieve Web search resul...
متن کاملOntology Based Framework for Web Page Information Extraction
Nature of Web information is dynamic and irregular that’s why it is difficult to search and integrate information from the Web. The biggest task in making WWW data accessible to users/agents is extracting the data from Web pages. We take advantage of information in existing Web pages to creating structured data semi-automatically. Extraction of information from semi-structured or unstructured d...
متن کاملUse of Kolmogorov distance identification of web page authorship, topic and domain
Recently there has been an upsurge in interest in the use of information entropy measures for identification of similarities and differences between strings. Strings include text document languages, computer programs and biological sequences. This work deals with the use of this technique for author identification in online postings and the identification of WebPages that are related to each ot...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Applications
سال: 2014
ISSN: 0975-8887
DOI: 10.5120/14849-3211